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ABSTRACT 

Recent measurements of the autocorrelation function of the Lya clouds are analyzed 
from the point of view of a simple model with strong clustering on the small scales. 
It is shown that this toy model reproduces fairly well the important linear relation 
between amplitude of the absorber autocorrelation function and neutral hydrogen 
column density. In addition, it predicts a correct evolutionary trend of correlation 
amplitudes. Some possible ramifications of these results are discussed. 
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galaxies: evolution 



1 INTRODUCTION 

The problem of clustering of the Lya forest absorbers has 
been discussed in the course of the last two decades by many 
authors (Sargent et al. 1980; Dekel 1982; Salmon & Hogan 
1986; Webb 1987; Ostriker, Bajtlik & Duncan 1988; Crotts 
1989; Heisler, Hogan & White 1989; Liu & Jones 1990; Webb 
& Barcons 1991; Barcons & Webb 1990, 1991; Fang 1991; 
Mo et al. 1992; Fardal & Shull 1993; Srianand & Khare 1994; 
Chernomordik 1995; Elowitz, Green & Impey 1995; Meiksin 
& Bouchet 1995; Carbone & Savaglio 1996; Srianand 1996; 
Ulmer 1996; Fernandez-Soto et al. 1996; Lanzetta, Webb & 
; Barcons 1996; Pando & Fang 1996; Cen & Simcoe 1997; 
Rauch 1998). The conclusions of the entire effort are still 
controversial, since the original paradigm that Lya clouds 
do not show no clustering at all (Sargent et al. 1980; Barcons 
& Webb 1990), was somewhat undermined by findings of 
weak clustering of intermediate-redshift Lya clouds (Webb 
1987; Barcons & Webb 1991; Webb & Barcons 1991; Cher- 
nomordik 1995), and seriously questioned by the work on 
associated C IV absorption (Cowie et al. 1995; Fernandez- 
Soto et al. 1996; see also Songaila & Cowie 1996). In an 
interesting work, Crotts (1989) has investigated correlations 
in the real space across the sky among systems in multi- 
ple lines-of-sight, and not only detected small-scale cluster- 
ing, but also established the increase of clustering ampli- 
tudes with increasing column density, the conclusion which 
we shall quantify below. Difficulties and limitations inherent 
in any attempt to use the two-point correlation function to 
deduce the properties of the Lya forest are summarized in 
Rauch et al. (1992) and Fernandez-Soto et al. (1996). The- 
oretical analysis of Dekel (1982) was very interesting in this 
respect, since it weakened the dominant paradigm of Sar- 



gent et al. (1980) — which, as we have seen, dictated much of 
the future development of absorption-line studies — where it 
looked strongest: in the cosmological part of the argument. 
As Dekel writes: "My aim is to point out a cosmological sce- 
nario in which galaxies are clustered only weakly at z > 1.7, 
so that the LACs [Lya clouds] may cluster just like galax- 
ies. Here both the isothermal and the truncated adiabatic 
components of the density perturbations play a role in the 
formation of structure in the universe." It is also quite in- 
teresting to note that Dekel (1982) was the first to suggest 
usage of C IV lines to measure the degree of clustering of 
absorption systems, idea which was fully realized only 14 
years later by Fernandez-Soto et al. (1996). It should cer- 
tainly be mentioned that low spectral resolution of most of 
existing measurements (e.g. 250 — 300 km s _1 for the HST 
Key Project; see Jannuzi 1997; Jannuzi et al. 1998) makes 
investigations on small scales exceedingly difficult. And it is 
exactly these scales which are, because of their discrimina- 
tive power, the most interesting from our point of view. 

Webb (1987) was the first to point out the presence of 
weak clustering at small velocity scales, based on the Voigt 
profile fitting in the high-redshift data, which was confirmed 
by other investigations (e.g. Muecket & Mueller 1987). As 
emphasized by Fernandez-Soto et al. (1996), it is very diffi- 
cult to directly detect clustering of the Lya lines because of 
the short redshift path length in any individual QSO spec- 
trum, and line blending. If a significant fraction, or most of 
the observed Lya absorption lines are in fact blends of very 
narrow components, detected amplitudes of the autocorre- 
lation function would be significantly underestimated (see 
also a discussion in Rauch et al. 1992). 

On the basis of these early findings, Ostriker et 
al. (1988) first proposed gravitationally induced clustering 
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as one of the possible explanations for excess of pairs of 
absorbers at small (and in their sample, intermediate^]) ve- 
locity splittings. However, in a recent important study, Cris- 
tiani et al. (1997), performed the most comprehensive anal- 
ysis of the clustering of Lya clouds in a sample of about 
1600 absorbers along 15 lines of sight, and concluded that 
the small-scale clustering of Lya absorbers (a) is real, and 
(b) can be understood in terms of gravitationally-induced 
clustering, in the manner of Ostriker et al. (1988). Paren- 
thetically, the existence of structure in the Lya forest was 
confirmed by independent methods aimed at detection of 
the deviation of spatial distribution of absorbers from a uni- 
form random one; thus Fang (1991) showed that Lya forest 
deviate from a uniform distribution at 3<r significant level. 
This is, of course, still weaker from the non-uniformity seen 
among the known galactic population, but very different 
from the picture of uniform, diffuse intergalactic population 
envisaged by Sargent et al. (1980). 

Another recent work of great importance for the de- 
velopment of our ideas on the spatial distribution of Lya 
clouds is that of Ulmer (1996), who investigated a sam- 
ple of low-z Lya lines recently obtained with the HST Key 
Project (Bahcall et al. 1996). Results of that work are par- 
ticularly significant, since they demonstrate the existence 
of strong clustering of Lya lines at velocity separations at 
which high- z lines seem completely unclustered, and which 
represent an intermediate regime between the small-scale 
(Aw < 200 km s _1 ) and large scale clustering. In physical 
terms, we can hypothesize that the small-scale regime can 
be plausibly explained as characteristic of the intragalactic 
motions in typical L ~ L* galaxies, being on the same or- 
der as velocity dispersion in the known galactic subsystems 
(Binney & Tremaine 1987). On the other hand, large-scale 
clustering is generally believed to trace large-scale struc- 
ture (i.e. structures with velocity dispersions similar to that 
in rich galaxy clusters and higher). Ulmer (1996) has not 
obtained any information on the small velocity splittings, 
since his method explicitly rejects velocity splittings with 
Av < 250 km s _1 . Still, its implications are important be- 
cause of the very strong signal found for 250 < Av < 500 km 
s~\ which, if extrapolated into Av < 250 km s _1 agrees well 
with results of Cristiani et al. (1997) and those discussed in 
further text. Similar excess of absorber pairs for Av ~ 200 
km s _1 was found by Srianand & Khare (1994) at the ~ 4<r 
level. The redshift evolution of clustering is also correctly 
emphasized in Ulmer (1996), who inferred a substantial in- 
crease in the degree of clustering of the Lya forest. As we 
shall see, there is evidence that the same trend is real over 
most of the history of the universe. 

In the rest of this paper, we shall attempt to show that 
the results of Ulmer (1996) and Cristiani et al. (1997) for 
the amplitude of TPCF are consistent with simple model 
characterized by constant small-scale clustering. Specifically, 
we shall show that (i) the linear relation between column 
density of clouds and their autocorrelation amplitude, and 
(ii) the general evolutionary trend of decrease in cluster- 
ing with increasing redshift, are successfully explained in 
such a toy model. Following two sections are, therefore, de- 



* which has not been confirmed afterwards; see the discussion in 
Chernomordik (1995). 



voted to these two important issues. Although not specifi- 
cally endorsing such a simplistic approach, it does make the 
complex explanations for the observed TPCF properties, in- 
volving biasing for the structure formation and gravitation- 
ally induced correlations, unnecessary. In contradistinction, 
models in which the Lya forest is locally decoupled from 
the Hubble flow and physically associated with collapsed 
structures (e.g. galaxies) predict, in general, exactly such a 
behavior. 



2 A SIMPLE MODEL 

Let the Lya cloud distribution function (neglecting the 
Doppler parameter dependence) be written in the standard 
form as (e.g. Lu et al. 1996): 



F(N,z) 



dn 



dz cW H 



= A {l + zYN B 



(1) 



Constants in equation ([j]) were measured by various authors 
(Hu et al. 1995; Lu et al. 1996; Kim et al. 1997). For our 
purpose, it is enough to take approximate values 7 = 2.75, 
and (3 — 1.55 for the indices of redshift and column density 
distribution respectively. One of the ways to simplify rela- 
tion (Q) is to consider only column density dependence in 
a sufficiently large sample of absorbing lines. This column 
density distribution function we shall denote by f{N), and 
its standard functional form as 



/(TV) = BN~ 



(2) 

9.2 x 10 8 (Lu et 



Normalization for f(N) is given as B 
al. 1996). 

Let us consider a simple model in which absorbers along 
the line of sight are clustered around given points along the 
line of sight with small-scale clustering described by 4>{ v ) m 
the form of the step function, such that 



4>{v) 



= const. : v < a max 

: V > CT ma x 



(3) 



Here, cr max is the maximum total velocity dispersion char- 
acteristic for Lya absorption systems, i.e. both intragalactic 
and intergalactic, although at this stage its physical origin is 
not crucial. Following Fernandez-Soto et al. (1996), we shall 
take a fiducial value <r max = 150 km s _1 (see also Crotts 
1989; Mo et al. 1992). 

The TPCF is, in general, defined by the probability 



dP = (1 + £)n a dv. 



(4) 



For our simple model of clouds concentrated around given 
points along the line of sight the differential probability of 
finding another cloud at velocity separation dv is simply 



dP — (f>(v)dv + n a dv. 



(5) 



In these relations, n a is the average absorber density along 
the entire line of sight given as 



n a (N mia ,z)= J jF(N,z)dNdz. (6) 

Consistency requires that probabilities in eqs. (^J) and (|H]) 
are equal. It immediately follows that 
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0(») 



4>{v 



r 

Jn 



which gives the amplitude of TPCF as a function of thresh- 
old column density iV m i n . Assumption here is that the col- 
umn density distribution function stays the same at all red- 
shifts (i.e. along the entire line of sight), enabling us to sup- 
press the epoch dependence in (Q) . Notice that in this model 
the quantity <j>(v) is determined by the extrapolated unity 
column density correlation through relation 

.0-1 



log f (log N = 0) = log tj>(v) + log ■ 



B 



(8) 



Note that up to this point it does not matter whether <j>(v) 
is constant within some velocity range, as we supposed in 
eq. ([]), or not. From the mathematical point of view, this 
hypothesis remains unnecessary; however, in order to estab- 
lish firm contact with correlation observations of necessarily 
very limited velocity resolution, we shall henceforth explic- 
itly assume 4>{v) = 4> for small velocity splittings. (Another 
reason, as we shall see, becomes manifest when the redshift 
evolution of clustering is investigated.) 

This is certainly the simplest conceivable model of the 
small-scale clustering: we have taken everything constant, 
except for the absorber number density. Calculation per- 
formed using above listed numerical values of various param- 
eters of the distribution function and the data set of Cris- 
tiani et al. (1997) shows that (log £, log iV m i n ) curve is very 
well approximated by a linear dependence giving a value of 



log cj) = 0.16. 



(9) 



This result is shown in Figure 1. Obvious correlation is ac- 
counted for by our toy model under the assumptions dis- 
cussed above. The theoretical slope a t h of the linear fit 
y — ax ± b is determined just by the index of the column 
density distribution 



0-1 =0.55 ±0.05, 



(10) 



(if we consider the best fit of Lu et al. [1996] as reliable), 
independent of <j>. We see that the empirical slope 



0.64 ±0.06, 



(11) 



is equal to the prediction (|icj) within uncertainties. This fact 
lends a strong support to our hypothesis. 

The value of <fi in eq. ^ should be regarded as the 
lower limit for small-scale clustering, since it includes the 
lowest column density point in Cristiani et al. (1996) data, 
corresponding to the column density below the break in the 
distribution (Hu et al. 1995), for which not only should the 
different value of the exponent in the distribution function 
used in evaluating <f), but the very question of the possibility 
of the interpretation of these, lowest column density systems 
in the framework of our model is doubtful. In Figure 1 we 
see that a linear fit corresponding to constant clustering on 
the small scales is quite satisfactory with the significance 
of 84%. It may be noticed that the only significant non- 
linearity appears at the smallest column densities, where a 
diffuse, truly intergalactic, population is expected; our sim- 
ple model does not apply to these clouds (Fernandez-Soto et 
al. 1996; Gnedin & Hui 1996; Weymann et al. 1998). Very 
low column density Lya forest is likely to belong to different 
population of cosmological objects (Hernquist et al. 1996; 
Bi & Davidsen 1997; Rauch 1998). This is in accordance 




Figure 1. The dependence of the TPCF amplitude (per 100 km 
s~ x ) on the column density of Lya and C IV (two highest column 
density points) clouds. We see that a linear fit corresponding to 
constant clustering on the small scales is quite satisfactory with 
the significance of ~ 84%. It may be noticed that the only sig- 
nificant non-linearity appears at the smallest column densities, 
where a diffuse, truly intergalactic population is expected; the 
toy model is inapplicable to these clouds. 



with findings of Chen et al. (1998) that column density of 
halo clouds sharply declines with galactocentric distance; 
since that study also establishes a well-defined maximal ra- 
dius for absorption (J? m ax ~ 174 h~ x kpc for L, galaxies), 
it is clear that there is a threshold column density, below 
which clouds can not be associated with galaxies. It is ob- 
vious that the log Am = 13.30 cm -2 point shows the poor- 
est agreement with the linear fit, and excluding it from the 
fit gives unchanged slope a' culp = 0.59 ± 0.06 (showing a 
satisfactory stability of our model). Although the value of 
log£(logiV = 0)' is still within uncertainties equal to the 
previous value, the central value of <f) is, however, different 
by the factor of about 4, since we are dealing with the unfor- 
tunate near-cancellation of two large factors. If this lowest 
column density point is discarded, the resulting linear fit is 
significant on the » 96% level. Although the number of data 
points is small, the general conclusion is that the more re- 
alistic fit will tend to give larger values of 6 and hence the 
stronger clustering than that given by eq. (H). 



3 REDSHIFT EVOLUTION 

A trend of decreasing clustering with increasing redshift may 
also be explained by small-scale constant clustering model. 
Since the number density of the absorbers counted from any 
fiducial column density N m i a upward increase as 

^<x (! + *)<, 
dz 



(12) 



we could expect from the Equation Q that in a fixed veloc- 
ity bin, the TPCF amplitude will behave as inverse number 
density of absorbing clouds, i.e. 



£oc(l + z) 



-C-l 



(13) 
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1+z„ 



Figure 2. The rcdshift dependence of the TPCF amplitude in the 
first (100 km s -1 . With z mcd we denote the median redshift in 
each of the three rcdshift bins of Cristiani et al. (1997). This trend 
is not noticed at v > 200 km s -1 , which can be accounted for, 
since only very weak clustering is expected above some maximum 
velocity dispersion <7 ma x, which value is set by the physics of 
extended gaseous haloes. 



i.e. decrease with increasing redshift. Again, this is valid 
for fixed v < Umax. Although the data presented in Fig- 
ure 2 are certainly insufficient to achieve firm conclusions in 
this regard, they are nevertheless suggestive. We notice the 
decrease in the TPCF amplitude quite clearly in the first 
(v = 100 km s _1 ) bin, much less pronounced in the sec- 
ond (keep in mind that we set <T max = 150 km s _1 ), and 
completely nonexistent for larger velocity separations. It is 
very difficult to infer any quantitative relation from data 
as such, but we note that the observed decrease between 
the first and the third redshift bin at 100 km s~ separa- 
tion is within a 20% from the theoretical value produced 
by the simple model, using Kim et al. (1997) value for the 
high-0 Lya clouds £ = 2.78 ± 0.71 (but uncertainties are 
quite large). The main conclusion that clustering decreases 
with increasing redshift is incompatible with those classi- 
cal intergalactic models of Lya clouds in which the Hub- 
ble expansion and evolution of metagalactic background are 
only forces driving evolution of absorbing material, such as 
pressure-confined models of Sargent et al. (1980) or Ostriker 
& Ikeuchi (1983). The data point with 1 + z me d = 1.7 from 
Ulmer (1996) is included in Fig. 2, although it corresponds 
to larger velocity scales and is not directly comparable to 
the other data points. Our motivation here is that it may be 
regarded as a lower limit for the region of interest; as Ulmer 
(1996) noted: "...However, the lines appear to be strongly 
correlated, and the number of expected unresolved pairs 
(with Av < 230 km s _1 ) is at least as large as the total 
number of resolved pairs with 230 km s _1 < Av < 460 km 
s _1 ." This is certainly to be expected if clouds are physically 
associated with galaxies, since we confidently know that cor- 
relations of the known galactic population were smaller in 
the past. Since the large galaxy surveys of our time have be- 
come available, such investigations were performed several 
times (Infante & Pritchet 1992; Bernstein et al. 1994) with 
clear result: amplitude of the small-scale clustering increased 



by a factor of ~ 2 from the z = 0.3 epoch to the present 
epoch (z < 0.1). It should be emphasized that the obsta- 
cles in precise quantification of this effect are enormous, as 
discussed in detail in the study of Bernstein et al. (1994). 
Also, but less significantly, the theoretical work on N-body 
simulations (e.g. Yoshii, Peterson & Takahara 1993) came to 
the same conclusion about the general trend of the galaxy 
autocorrelation evolution. Thus, increased clustering of the 
Lya absorbers may be better understood in framework of 
the same physical processes which govern the evolution of 
clustering of normal, luminous galaxies. The same general 
trend of increasing clustering with decreasing redshift is in- 
dicated by the data on Lya forest in the HDF-S (Savaglio 
et al. 1999). 

Additional argument in favor of this simple picture 
comes from considerations of influence of the absorbing 
cloud size on the correlation amplitudes. As correctly 
pointed out by Cristiani et al. (1997), a spatial correlation 
function convolved with velocity dispersion produces a cor- 
relation function in the velocity space similar to what is 
observed if a cloud sizes of ~ 7.5 h^ 1 kpc irrespectively of 
redshift are assumed. We propose that this is quite realistic 
situation, and that realistic velocity dispersions require sim- 
ilar, or even smaller sizes, quite in accordance, for example, 
with the sizes obtained by the two-phase gas halo models 
of Mo (1994) and Mo & Miralda-Escude (1996), Chiba & 
Nath (1997) or Miyahata & Ikeuchi (1995). Such clouds, 
having total masses ~ 10 7 Mq are similar to progenitors 
of the present-day globular clusters. It is indicative that a 
decrease in the dominant velocity dispersion scale, which is 
allowed by all available empirical data (both from autocor- 
relation measurements and investigations of close pairs of 
lines of sight) will result in decrease in sizes of individual 
contiguous clouds. 



4 DISCUSSION 

This simple picture is what is generally expected from clouds 
residing in haloes dominated by dark matter, which are 
plausible physical candidates for our points around which 
absorbers are clustered with amplitude <f>(v). In physical 
terms, the dependence of absorbing column density on dis- 
tance from the center of an Lb galaxy (Chen et al. 1998) 



l0 Kl^ln^)=- 5 - 331 ^( 



+2.19 log 



10 kpc 
Lb 



+ 1.09, 



(14) 



implies that strong absorption will be seen only near the halo 
center. It is straightforward to conclude that these strong 
and rare absorption lines have a relatively large probability 
of having weaker companion lines originating in the same 
halo, i.e. within the galaxy velocity dispersion. Therefore, 
overall clustering strength is expected to increase with col- 
umn density. These absorption sites can be classical haloes 
of luminous galaxies or minihaloes (e.g. Meiksin 1994; Rauch 
1998). Hypothesis that at least a fraction of Lya clouds is lo- 
cated in extended haloes of luminous galaxies has found very 
strong support in low-redshift coincident studies (Lanzetta 



© 2000 RAS, MNRAS 000, 000-000 



Small-Scale Clustering 5 



ct al. 1996; Chen et al. 1998). In other words, the re- 
sults of the present small-scale clustering analysis presented 
support the general subclass of models with dark matter- 
dominated gravitational confinement. The theoretical basis 
of such models is given in important works of Mo (1994) and 
Mo & Miralda-Escude (1996), where the strongly physically 
motivated two-phase gaseous halo model has been developed 
in some detail. It is not clear at present whether some vari- 
ant of Black's (1981) classic self-gravitating confinement can 
also be cast in form which will satisfy the TPCF constraints, 
but it does not seem very promising. Contrariwise, the the- 
ories of origin of the Lya forest which link the absorption 
to larger systems, i.e. clusters or superclusters (Oort 1981; 
Doroshkevich 1984) are in clear disagreement with these cor- 
relation measurements, due to much higher velocity disper- 
sion of such structures. The same applies, as correctly noted 
by Srianand (1996), to the theories involving explosion- type 
processes (Ozernoy & Chernomordik 1978; Chernomordik & 
Ozernoy 1983; Vishniac & Bust 1987). On the other hand, 
strong anticorrelation between low- 2 Lya equivalent widths 
(i.e. H I column densities) and galaxy impact parameter 
in absorption-selected galaxy sample of Chen et al. (1998), 
indicates that these objects share intragalactic velocity dis- 
persions (i.e. the same velocity scales as discussed here). It 
is difficult, however, to distinguish between models with ex- 
tended gaseous haloes and huge disks of Maloney (1992). It 
should be mentioned that York et al. (1986) have argued 
that there are large hydrodynamic velocities observed in ab- 
sorption line systems which are similar to those seen in lines 
of sight through galaxies with active star formation. Along 
the same line of thought one should consider the finding of 
the HST Key Project (e.g. Boksenberg 1995) showing that 
the line density of Lya absorption systems is greater by 
nearly an order of magnitude in the vicinity of metal-line 
systems (which are believed to originate in haloes of normal 
galaxies). In the framework of our toy model, this could be 
interpreted as an observational justification for using negli- 
gible <f>(v) outside the range spanned by intragalactic veloc- 
ities (0 — 250 km s _1 ). In general, the conclusion that the 
redshift dependence of the correlation function amplitudes 
can discriminate between various models of Lya clouds has 
important and far-reaching consequences. 

Parenthetically, the lack of power in the Lya absorp- 
tion line TPCF amplitudes at larger scales in comparison 
with the TPCF of local galaxies need not, along the gen- 
eral proposition of Dekel (1982), necessarily be understood 
as indication for physical difference of absorber and galaxy 
populations. Instead, one may follow the suggestion of Fang 
(1991) whose results indicate that a biased clustering in the 
universe simply has not occurred yet at z ~ 2 on large (co- 
moving) scales. We should keep in mind that absence of 
large-scale clustering in the conventional Lya forest sam- 
ples has been based mainly on surveys of high- z absorption 
lines, usually with (z) > 2. Only large Lya forest surveys at 
intermediate and small redshift, likely to be available in the 
near future, will be capable of definitely solving the puzzle. 

We conclude that at this level of accuracy of the TPCF 
measurements, a simple model with large and constant 
small-scale clustering is able to account for available obser- 
vational evidence. Empirically well known galactic velocity 
dispersion seems to be capable of entirely explaining ob- 
served small-scale clustering properties of Lya clouds, in 



agreement with the Occam's razor. Much further theoret- 
ical work is certainly necessary in order to elaborate the 
details of the galactic halo model for Lya clouds. However, 
the fact that this model is naturally arising in a compelling 
theoretical picture such as the halo cloud model is quite re- 
markable. 
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